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Abstract 


In  an  earlier  paper an  algebraic  characterization 
was  made  of  the  problem  of  resolving  closely  spaced 
plane  waves  incident  on  a  linear  array.  The  character¬ 
ization  suggests  several  data-adaptive  processing  meth¬ 
ods  and  encompasses  the  Wiener,  Maximum  Likelihood,  and 
Pisarenko  methods.  In  this  paper,  the  algebraic  ap¬ 
proach  is  amplified  and  the  results  extended  to  consider 
correlated  noise.  A  recursive  algorithm  is  given  for  a 
particularly  effective  processing  method.. 


An  important  array  processing  problem  is  that  of 
determining  the  directions  of  propagation  of  plane  waves 
incident  on  a  linear  array  of  uniformly  spaced  sensors.^ 
Contemporary  spectral  analysis  has  led  to  the  develop¬ 
ment  of  several  array  processing  methods  that  are  able 
to  resolve  plane  waves  with  nearly  identical  directions 
of  propagation.  These  methods  include  the  Wiener  Pre¬ 
diction  Filter  method, 3  the  Maximum  Likelihood  method, 3 
and  the  Pisarenko  method.**  This  paper  amplifies  and 
extends  an  algebraic  approach  given  earlier^-  based  upon 
an  algebraic  characterization  of  the  array  processing 
problem.  The  results  encompass  the  methods  mentioned 
above  and  include  the  case  of  correlated  noise.  A  re¬ 
cursive  algorithm  is  presented  for  implementation  of  a 
particularly  effective  processing  method. 

Model  of  the  Array  Data 

Consider  the  complex  sinsoidal  time-space  plane 
wave  f(t,r)  as  represented  by 

f(t,r)  «  Ae  -  U) 

where  A  is  the  complex  amplitude,  t  is  the  continuous 
time  variable,  r  »  xi+yj+zk  is  the  continuous  8pace_^ 
variable,  u  is  the  (temporal)  frequency,  and  k^  »  k^l 
♦k  3+k  k  is  the  wavenumber  (spatial  frequency).  This 
wave  travels  in  the  direction  of  -k  with  a  speed  of 

propagation  c  =  .  Let  us  now  monitor  this  wave  with 

a  linear  array  placed  along  the  x-axls  whereby  y  •  z  ■  0 
as  shown  in  Figure  1.  The  detected  signal  is 


J[ut+k  x] 

f^(t,x)  *  Ae  .  (2) 

From  this  ideal  data  it  is  theoretically  possible  to 
determine  the  values  of  the  parameters  u  and  k^j.  Fur¬ 
thermore,  if  the  speed  of  propagation  is  a  known  con¬ 
stant  or  a  known  function  of  frequency,  we  can  then 
calculate  the  wavenumber's  magnitude  from 

IHl  =  ^  •  (3) 

Because  we  do  not  have  comjlete  knowledge  of  the  wave¬ 
number  k,  we  cannot  unambiguously  determine  the  direc¬ 
tion  of  propagation.  However,  we  can  determine  the 
polar  angle  y  asr.oclated  with  the  wave  ar. 


This  angle  defines  a  cone  whose  central  axis  lies  along 
the  linear  array.  This  information  alone  is  sufficient 
for  many  applications.  For  example,  the  wave  may  be 
known  a  priori  to  be  traveling  in  the  xy  plane.  This  is 
the  case  ve  shall  consider  hereaftemu- . .  - - - 
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Figure  1 .  Linear  array  aloil 


We  have  reduced  the  problem  from  four  dimensions  in 
the  variables  to  two  dimensions,  through  the  constraint 
of  a  linear  array.  We  can  further  reduce  the  problem  to 
one  dimension  by  noting  that  time  and  space  are  inde¬ 
pendent  quantities  in  this  model.  Thus  we  can  perform 
our  analyses  for  u  and  k^^  separately.  Ordinary  time 
series  processing  such  as  filtering  or  spectrsil  estima¬ 
tion  can  be  applied  to  each  sensor  to  first  determine 
the  presence  of  signals  at  a  particular  temporal  fre¬ 
quency  ID.  These  outputs,  one  for  each  sensor,  are  then 
spatially  processed  to  determine  the  directions  of 
sources  radiating  at  the  frequence  id.  Hereafter,  we 
shall  suppress  the  time  domain  and  consider  only  the 
3  5 

spatial  dimension. 

Figure  2  depicts  a  linear  array  of  p  sensors  uni¬ 
formly  spaced  d  units  apart.  A  plane  wave  is  impinging 
upon  the  array  with  an  Incident  angle  6.  Noting  that 
the  incident  angle  is  complementary  to  the  polar  angle, 
we  have 

k  k  k 
sin  9  -  1 

|k|  ID/C  £.S 

from  which  it  is  seen  that 

X  .  g)L4n_e  (5) 

X  X 

where  X  is  the  wavelength  of  the  plane  wave.  Defining 
our  origin  at  senpflr.  zero,  the  nbh  sensor^wlll  sample 


or^ln  a 
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the  wave  at  the  point  x  >  nd.  Hence,  et  any  particular 
ine.tarit  in  tine  the  array  output  is,  from  (2) 


y(n) 

where  ♦  la  a 

lr:;t  .'011 . 


■  Ae  L  *  J  (6) 

o*n-<p 

phase  angle  dependent  on  the  aampllng 


f^(nd) 


Figure  2.  Plane  wave  with  Incident  angle  0. 


Tne  set  of  p  Instantaneous  spatial  samples  (6)  as 
nea.-ured  by  the  array  Is  referred  to  as  a  "snapshot." 

Ir.  this  case  the  snapshot  Is  a  sampled  complex  sinusoid 
wl.ore  sampled  spatial  frequency  Is  given  by 

^  ^  .  Clearly,  an  estimate  of  the  sinusoid's 

frequency  directly  yields  an  estimate  of  the  direction 
of  rroragation  of  the  plane  wave  if  the  wavelength  X  Is 
Kr.:wi..  As  such,  spectral  estimation  Is  seen  to  play  a 
ircrinent  role  in  linear  array  processing. 

Wc  now  generalize  our  model  to  include  multiple 
fliU’  waves  incident  on  an  array  in  which  the  sensor 
ini; -it  ions  are  contaminated  by  white  measurement  noise. 
T  f  t  here  are  a  total  of  q  plane  waves  and  the  plane 
w  .VC  ha.  a  direction  of  propagation  6|^,  it  follows  by 
■  ur'  rj  r.ltion  that  the  snapshot  will  have  the  form 


»  J*.  Jnu. 

y(n)  =  ri(n)  *  ^  •  0£n<_p-l 

k=l 

wti<  rc  the  q  cinuosoid  frequencies  are  given  by 
2nd  sin  0^ 


(7) 


and  n(n)  are  uncorrelated  zero  mean  random  variables 

n 

with  variance  o'"  that  represent  the  measurement  noise. 

Wn  assume  that  the  u,  are  all  different. 

Our  objective  is  to  estimate  the  frequency  param¬ 
eters  u  using  these  snapshot  measurements.  We  are 
particularly  interested  In  the  ability  to  resolve,  or 
distinguish  between  two  plane  waves  with  very  similar 
freqii' hcies  (i.e.,  u  i  u,^) .  This  estimation  capability 
re.iuir'-  ■  the  utilization  of  a  number  of  snapshots  taken 
.sequentially  in  time.  Our  data  then  has  the  form 


(  n )  =  n_  ( 


n_  I  n ) 

Ti 


A  e 
k 


•’♦km  •’"“k 


l^mf^M,  0£n£ p-1 


k=l 


(6) 


where  m  ir  tlie  sn.a|r.liot  index  and  M  is  the  total  number 
of  snapshot.-..  In  tliis  model,  we  assume  that  the  phase 


angles  arc  uncorrelated  random  variables  uniformly 
distributed  on  [-Si-t-x].  Their  random  nature  arises 
from  the  independence  of  the  sinusoidal  sources  and  from 
the  approxistate  randosmees  of  time-aampling  far  below 
the  Ityqulst  rate. 

It  la  convenient  at  this  point  to  represent  the 
problem  in  vector  notation.  We  represent  the  m^h  snap¬ 
shot  (8)  by  the  p*l  column  vector 

y  ■  y  (1)  ...  y  (p-1)]'  (9) 

"ID  n  tn  rn 

and  define  the  pure  complex  sinusoid  vector  as 

S  -  [1  e-’“  e-’^“  ...  .  (lo) 

— |g) 

Lastly,  the  noise  vector  associated  with  the  mth  snap¬ 
shot  Is  defined  as 

Tn  “  ■" 

With  the  above  notation,  we  may  compactly  represent  the 
snapshots  by  the  data  vector  equation 


ifm  ‘  \  £ 
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km, 
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k«l 


(12) 


The  array  data  (12)  is  random  due  to  its  dependency 
on  the  random  phase  angles  and  the  contaminatin.c 
noise  ng(n).  Assuming  that  these  random  variables  are 
pairwise  uncorrelated  and  statistically  invariant  with 
respect  to  the  snapshot  index  m,  it  follows  that  each 
data  vector  y  is  a  windowed  realization  of  a  vide-sense 
stationary  raRdom  process.  The  mean  value  of  this  proc¬ 
ess  is  the  zero  vector. 
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while  its  covariance  matrix  is  specified  by 


(13) 


E{y.y^ 
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where  is  the  p*p  identity  matrix  and 

the  power  of  the  kth  incident  plane  wave.  Since  t;., 
random  vector  process  is  wide-sense  stationary,  the 
covariance  matrix  R  must  be  positive  ser.i-definite, 
Toeplitz,  and  Hermitian. 

We  now  describe  three  contemporary  array  processir..- 
methods  and  then  present  an  algebraic  arproact;  tc  iden¬ 
tifying  the  frequencies  (u)^),  based  upon  the  structure 
of  the  data  and  the  associated  covariance  matrix  R. 

Contemporary  Processing  Methods 


Wiener  Filter  Method 


The  Wiener  Filter  method  is  based  on  filterinr  the 
data  such  that  the  signal-to-nolsr  ratio  at  the  filter 
output  is  maximized. It  is  essentially  a  linear  pre¬ 
diction  approach  that  is  quite  sir.ilar^to  the  Vaxirui- 
Entropy  method  of  spectral  estimation.'  Many  adaptive 
array  processing  algorithms  arc  equivalent  to  the  Wiener 
Filter  method.  Including  Alan's  orthonom.-il  lattice 
filter  algorithm.^  For  the  array  processing  problem  an 
optimum  weighting  vector  is  obtained  by*''^ 


a  =  uR'^h  (l^-) 

where  h*[l  0  0  ...  Ol’  and  u  ie  an  arbitrary 

seiilar.  As  in  the  Maximum  gntropy  Metb.o.i,  the  power 
Biwctrum  m.ay  be  computed  by 


Islil' 


s''’a  a'^S 


(16) 


Maximum  Likelihood  Method 


The  Maximum  Likelihood  method  is  based  on  filtering 
the  data  such  that  pover  at  the  frequency  of  interest  in 
passed  and  all  other  frequency  components  are  rejected 
in  an  optimal  manner.  In  our  notation,  the  power  apec- 
trura  is  given  by  6 19,10 


-U  “Til 


(IT) 


Pisarenko  Method 


The  Pisarenko  method  has  not  found  u  widespread 
use  as  the  Wiener  Filter  method  and  the  Maximum  Likeli¬ 
hood  method.  Haykin**  recently  applied  part  of  the 
Pisarenko  method-1  to  the  array  processing  problem  via 
a  special  autoregressive-moving  average  (ARMA)  model. 
The  complete^^isarenko  method  is  based  on  a  theorem  of 
Caratheodorj'  that  allows  decomposition  of  the  exact 
truncated  covariance  sequence  r(n),  O^n^q-1,  into  a 
pcsitively-welghted  sum  of  q  complex  sinusoids  and  white 
noise.  The  method  has  three  steps: 

(i)  Identifying  and  removing  the  noise  contri¬ 
bution  to  the  covariance  matrix, 

(ii)  forming  the  qxq  covariance  matrix  and 

and  analysing  the  single  eigenvector  corre- 

ponding  to  the  unique  minimum  eigenvalue 

1  ,  «  0  to  determine  the  sinusoid  frequencies, 

min 

and 

(lii)  solving  a  set  of  q  simultaneous  linear 
equations  for  the  sinusoid  powers. 

Algebraic  Processing  Approach 

We  now  formulate  a  generalized  minimization  problem 
which  suggests  several  particular  methods.  Under  dif¬ 
ferent  constraints,  the  solution  of  this  problem  encom¬ 
passes  each  of  the  methods  in  the  previous  section. 

Let  us  consider  a  general  nontrivial  p*l  coeffi¬ 
cient  vector  a  that  is  orthogonal  to  the  noise-free 
component  of  each  of  the  data  vectors  From  (12), 

th.i  s  orthogonality  is  defined  by  the  inner  product 
relationship 

0-  <1. 

(18) 

k=l  *' 

Since  the  (u.  }  are  all  different  and  the  (♦.  )  are  ran- 
dom  in  nature,  a  little  thought  will  convince  oneself 
that  a  must  be  orthogonal  to  each  of  the  q  sinusoid 
vectors  S  ,  1 <  k  <  q.  We  next  define  the  general  s- 
~'^k 

transform  A(z)  of  the  coefficient  vector  a  by 

A(z)  «  <  a,  z*  >  (19) 

where  z  *  [l  z"^  z”^  ...  z^”**]  .  It  is  then  readily 

shown  that  the  orthogonality  of  a  to  each  S  ,  1 < k < q, 

implies  that  A(z)  must  have  q  finite  zeros  located  on 

••“k 

the  unit  circle  at  the  points  z^^  *  e  ,  lik<,q.  With 
this  in  mind,  the  required  frequencies  can  be  deter¬ 
mined  by  examination  of  the  zeros  of  A(z). 


In  the  idealised  noise-free  case,  the  snapshot  vec¬ 
tors  ^  are  members  of  the  q-dimensional  subspace  span¬ 
ned  by  the  q  linearly  independent  sinusoid  vectors  S  , 

~“k 

l^kf^q.  Thus,  forq<pthere  always  exists  a  p*l  vector 
a  that  is  orthogonal  to  the  noise- free  components  of  the 
snapshot  vectors  jr^.  If  q^p,  there  generally  does  not 
exist  such  a  vector  a.  Furthermore,  when  noise  is 
present,  even  if  q<p,  there  generally  does  not  exist  a 
vector  a  that  is  orthogonal  to  each  of  the  noise- 
contaadnated  snapshot  vectors  Nonetheless,  it  is 

intuitively  desirable  to  select  a  coefficient  vector 
which  is  nearly  orthogonal  to  each  of  the  snapshot  vec¬ 
tors  in  some  well-defined  manner,  and  to  determine  the 
plane  wave  sinusoidal  frequencies  by  examination  of  the 
zeros  of  the  z-transform  of  this  coefficient  vector.  A 
convenient  method  to  evaluate  these  zero  locations  is  to 
search  for  nulls  in  the  magnitude  of  the  Fourier  trans¬ 
form  of  the  coefficient  vector.  Since  there  can  be  more 
zeros  than  plane  waves  (i.e.,  p-l>q),  we  can  estimate 
the  (Pj^)  in  order  to  separate  "signal  zeros"  from  "noise 
zeros" . 

To  obtain  a  mathematical  measure  of  closeness  to 
orthogonality,  it  is  beneficial  to  define  an  orthor'o- 
nality  error  vector  e^(a)  whose  mth  element  is  the  inner 
product  of  a  and  denoted  by  <a,y|,  >  .  We  define  the 
optimum  a  to  be  the  vector  a°  that  plnimizes  rone  posi¬ 
tive  definite  functional  f  of  Hence  we  write 

e{a)  =  led)  e(2)  ...  e(M)]' 

where 

e(m)  '  <  a,^  > 

and 

f(e(a'’))  =  min  f(e(a)) 
a<A 

where  A  is  a  constraint  set  for  the  solution  vector  a®. 

A  constraint  is  generally  necessary  for  the  minimization 
to  be  well-defined,  that  is,  for  a®  to  be  unique  and 
nontrivial . 

We  must  choose  an  inner  product  for  (20)  and  an 
error  functional  f  for  (21).  Let  us  choose  in  particu¬ 
lar  the  standard  vector  inner  product  <  a,^  >  «  * i’- 
which  case  we  have 


(20) 

(21) 


(22) 


A  convenient  positive  definite  functional  for  an  error 
vector  is  the  mean  square  error  criterion 

f(e)  •  E{||  e  11^)  (23) 

where  ||  e  ||  ■  ^|e(l)|^  ♦  ...  ♦  |e(M)|^,  the  Euclidean 

norm  of  e^.^^  It  will  be  computationally  expedient  to 
normalise  this  criterion  by  the  length  M  of  the  vector 
£.  From  (22)  and  (23)  we  have 


•  s^Rn  (2l*) 


3 


whfre  B  is  the  covariance  matrix  defined  in  (ill).  Apply¬ 
ing  (21),  the  quadratic  form  (2li)  must  now  he  mlnlmlted 
according  to  some  constraint  that  causes  ^  to  he  unique 
and  nontrivial.  Next  ve  consider  two  posslhle  con¬ 
straints,  a  linear  constraint  and  a  quadratic  con¬ 
straint. 

Linear  Constraint 

The  first  constraint  Is  that  a°  lies  on  a  hyper- 
planc  specified  by 

A  »  (at  C^:a''^h+h\  «  2)  (25) 

where  h  is  a  nontrivial  pxl  vector  that  characterizes 
the  orientation  of  the  hyperplane.  The  solution  to 
(21)  with  this  constraint  is  given  by 


1“ 

h  R  h 


and  the  criterion's  minimum  value  is 

J  .  ■  (27) 

h  R  h 

Quadratic  Constraint 

The  second  constraint  is  that  a°  lies  on  the  quad¬ 
ratic  surface  specified  by 

A  =  (a  <  C^ra^^Wa  «  1)  (28) 

where  W  is  a  positive  definite,  Hermitian  matrix  which 
characterizes  the  quadratic  surface.  The  solution  to 
(21)  with  this  constraint  is  given  by 


/x^,  Wx  . 
f -min  -min 


and  the  criterion's  miniraura  value  is 


Extension  to  Correlated  Noise 

Through  selection  of  the  constraint  set,  the  alge¬ 
braic  approach  extends  quite  readily  to  the  case  in 
which  the  contaminating  noise  Is  correlated.  Such  is 
the  ease  when  the  noise  Is  due  not  only  to  sensor  meas- 
ureMnt  noise  but  also  to  a  directional  background 
noise  field  in  the  array  environment.  Note  that  any 
undeslred  signal  (josmiing  interference,  for  example) 
may  be  considered  as  correlated  noise. 

Generalizing  (lli),  ve  have  that  the  data  covariance 
matrix  for  the  case  of  correlated  noise  is  given  by 


R  «  o  B 


Ts 


where  the  noise  covariance  matrix  B  is  defined  by 

.  (32) 

m  m 

We  assume  that  the  shape  of  the  noise  spectrum  is  known, 
which  implies  knowledge  of  B. 

Returning  to  the  quadratic  constraint  (29,30),  we 

note  (X  .  ,  X  ,  )  is  the  solution  to 
min*  -min 

(B-X  .  W)x  ,  •=  0  .  (33) 

min  -min  — 

We  have  seen  that  for  the  choice  W  =  I  we  have  the 

P 

Pisarenko  solution.  In  this  case  it  is  well  known  that 
2 

X  ,  =0  and  that  we  essentially  have  a  white-noise 

min  -- 

power  cancellation  algorithm.  ^  Hence  it  is  a  simcle 
step  to  choose 


.  .  and  achieve  a  colored-noise  correlation  cancellation 

algorithm.  This  step  can  be  Justified  further  by- 


algorithm.  This  s 
rewriting  (31)  as 


(R-o^B) 


s 


where  (X  ,  ,x  ,  )  is  the  minimum-eigenvalue  and  eigen- 
min  -min 

vector  pair  of  w'^B. 

The  above  solutions  encompass  the  three  contempo¬ 
rary  methods  described  earlier.  For h  «  [l  0  —  0]', 
2;;uation  (26)  is  precisely  the  Wiener  Filter  solution 


h'^R'^h 


Just  as  in  linear  prediction,  this 


cor..-traint  implies  that  the  first  element  of  a”  is 
fixed  at  1  and  tire  other  elements  are  unconstrained. 

For  h  =  f  ,  (27)  is  precisely  the  Maximum  Likelihood 
solution.  This  constraint  requires  A'’(z)  to  have  unity 
gain  at  z  »  eJ“,  while  the  minimization  strategy  opti¬ 
mally  reduces  the  gain  at  other  frequencies.  For 
W  =  Ip,  the  quadratic  surface  is  the  hypersphere  of 
radius  one,  and  (29)  is  a  generalized  version  of  the 
Pisarenko  method.  There  are  several  differences. 

First,  no  special  ARKA  model  is  invoked,  as  is  done  by 
Haykin.**  Second,  neither  noise  power  removal  nor  ma¬ 
trix  order  reduction  are  required,  as  they  are  in  the 
Pisarenko  method.  Third,  this  method  is  based  upon  a 
minimization  strategy  and  so  Justifies  estimates,  gen¬ 
erally  even  non-Toeplitz,  of  the  covariance  matrix  R. 

In  the  special  case  of  a  Toeplitz  estimate  matrix,  a 
power  identification  technique  similar  to  the  Pisarenko 
method  can  be  employed,  as  is  shown  later.  Fourth,  the 
general  constraint  matrix  W  allows  greater  flexibility 
in  the  solution.  The  quadratic  constraint  solution 
generalizes  the  Pisarenko  method  and  extends  it  to  the 
multiple-snapshot  array  processing  problem. 


and  remembering  that  we  seek  a  vector  that  is  orthogonal 

to  the  sinusoid  vectors  (S  )■  Also,  it  is  apparent 

~“k 
t  * 

from  the  constraint  a  Ba  =  1  that  we  are  again  specify¬ 
ing  a  set  of  vectors  with  constant  norm,  but  now  the 
norm  is  determined  by  the  noise  covariance  matrix  B. 

The  linear  constraint  (26,27)  may  also  be  extended 
for  correlated  noise.  We  shall  only  consider  the 
Wiener  solution,  since  it  has  been  shown  to  achieve  _ 
greater  resolution  than  the  Maximum  Likelihood  Method.  ’ 
We  will  show  later  that  to  extend  the  Wiener  linear  pre¬ 
diction  solution,  a  reasonable  constraint  h  given  B  is 


Note  that  we  no  longer  have  a  linear  predictor. 

Example 

To  illustrate  the  linear  and  quadratic  constraint 
solutions,  let  us  consider  the  case  of  a  single  plane 
wave  of  power  P.  and  spatial  frequency  u  incident  on 
an  array  of  two'^'sensors  in  a  correlated  noise  field. 
For  this  case  the  noise  covariance  matrix  is  given  by 


2 

o  B  =  0 


[::l 


where  |b|<i,  and  the  data  vector  covariance  matrix  by 


-Ju,  2 

P^e  ♦  pV 


•^“1  2. 


First  we  consider  the  linear  and  qviadratlc  con¬ 
straint  solutions  without  accounting  for  the  correlated 
noise  (i.e.,  the  Wiener  and  Pisarenko  methods).  For 
the  linear  constraint  we  choose  =  [i  0  ...  0)’ 

and  find  from  (26)  that 


P^e  ^  *  o‘'b 


Tefinirg  the  signal -to-noise  ratio  by  SIIH 
see  that  A°(z)  has  a  zero  located  at 


r  ■•’"i  1 

L  SKP  ■*■  b  e  ^ 

L  SKR  +  1  J  ■ 


■5"l  2  , 

iP^e  ^to^bl 


Thus  A°(z)  has  a  zero  located  at 


aro  e  +  b 

z  *  . - 

J"-, 


We  see  in  this  case  that  the  zero  lies  directly  on  the 
unit  circle,  regardless  of  the  slgnal-to-noise  ratio. 
In  fact,  when  the  noise  is  white,  the  zero  perfectly 
indicates  the  plane  wave  spatial  frequency  How¬ 

ever,  when  the  noise  is  correlated,  there  is  a  fre¬ 
quency  bias  present  that  again  becomes  greater  as  the 
signal-to-noise  ratio  decreases. 

I«t  us  consider  now  the  linear  and  quadratic  con¬ 
straint  solutions  which  account  for  the  correlated 
noise.  For  the  linear  constraint  we  choose 
h  «  B(1  0  ...  0)'  and  find  that 


P^Ce  ^-b) 

Pj(l-be  ^)  +  o^(l-|b| 


where  w  is  a  scalar  function  of  P. ,  u  ,  0  ,  and  b. 
Thus  A®(z)  has  a  zero  located  at  ^  ^ 


4 

SWP(l-be  ^) 
Sl»R(l-be  ■^)*l-|br 


As  in  (liO),  we  see  that  pure  white  noise  will  still 
cause  the  zero  to  migrate  away  from  the  unit  circle, 
and  that  correlated  noise  will  introduce  frequency  bias. 
However,  as  the  noise  becomes  "more  correlated"  (i.e., 
|b|  ->^1),  the  zero  moves  closer  to  the  unit  circle  and 
asymptotically  indicates  the  exact  plane  wave  spatial 
frequency  u^,  regardless  of  the  signal-to-noise  ratio, 
note  that  the  effect  of  an  interfering  harmonic  source 

J“2 

(i.e.,  b  «  e  )  is  completely  removed. 

For  the  quadratic  constraint  we  choos^  V  ■■  B  and 
find  that 


As  has  been  noted  elsewhere, ^5  zjjis  linear- 
prediction  solution  suffers  from  zero  migration  away 
from  the  unit  circle  even  when  the  noise  is  white  (i.e., 
b  «  0).  This  migration  degrades  resolution,  since 
applying  the  Fourier  transform  to  evaluate  zero  loca¬ 
tions  may  indicate  only  a  single  null  when  in  fact 
there  are  two  zeros  close  together  somewhat  off  the 
unit  circle.  Furthermore,  we  see  there  is  a  frequency 
bias  Introduced  by  the  correlated  noise.  This  bias 
becomes  greater  as  the  signal-to-noise  ratio  decreases. 

For  the  quadratic  constraint  we  choose  W  =  I  and 
find  hrom  (29)  that  " 


J“l  2 
P,e  •‘♦ob 


where  p  is  a  scalar  function  of  and  b.  Thus  A°(z) 
has  a  zero  located  at 


We  see  that  the  zero  indicates  the  exact  plane  wave 
spatial  frequency  oi^,  regardless  of  the  signal-to-noise 
ratio  or  the  particular  value  of  b.  For  this  reason, 
we  expect  the  quadratic-constraint  solution  to  obtain 
high  resolution. 

To  susssarize  the  development  to  this  point,  the 
algebraic  approach  is  based  on  approximating  an  orthog¬ 
onality  condition  between  a  solution  vector  and  each 
of  the  data  vectors.  This  approach  encompasses  three 
contemporary  array  processing  methods  and  readily 
extends  to  the  case  of  correlated  noise. 


Implementation  of  the 


dratic-Constraint  Solution 


In  the  previous  section  we  saw  that  the  quadratic- 
constraint  solution  (29,30)  is  a  promising  array  proc¬ 
essing  method  in  terms  of  its  perfect  resolution  given 
exact  covariance  vedues.  However,  it  requires  an 
eigenvalue-eigenvector  computation  that  seems  to  be 
quite  burdensome.  Fortunately,  a  simple  recursive  algo¬ 
rithm  can  be  derived  using,  the  nature  of  the  array 
processing  problem.  , 

First  we  recall  the  standard  "inverse  Iterat  ioii"^*^' 
method  for  finding  the  minimum  eigenvalue  and  eigen¬ 
vector  pair  of  a  complex  matrix  D.  Consider  the  se¬ 
quence  of  vectors  (x^)  defined  by 

"  Vi 

where  x^  is  a  nonzero  and  arbitrary.  Ac  k  increases, 
we  have 
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It  can  be  shown  that  6X  is  approximately  given  by 


^  ^in 


and 


t 

Vk 


^min 


This  method  io  appropriate  to  the  array  processing  prob¬ 
lem,  in  which  the  data  arrives  sequentially.  Assume 
that  from  M  snapshots  we  have  estimated  the  covariance 

matrix  by  and  obtained  the  desired  pair 
Wlien  the  next  snapshot  is  available,  we  form 
comi'ute  from  {kj)  using  D  »  and  Xj^_j  “  Xj^. 

Since  the  inverse  Iteration  method  generally  has  fast 
convergence,  a  single  iteration  of  (I47)  for  »Ay 

sufficient  as  long  as  Rj^  is  only  slowly  time-varying. 

To  accelerate  convergence,  we  can  apply  the  "inverse 
iteration  of  Wielandt"^^  wherein  an  approximation  of 
^min  subtracted  from  the  main  diagonal  of  D  before 
iterating.  Given  R,.  ,  ,  we  use  X.,  to  approximate  X„.,  . 
The  iteration  is  given  by 


^^M+l  ■  ^M^^^+1  "  ^  ' 
^♦1  ^  2nln  * 


^i+i^;+i 


=  X  -►  X  .  . 

M-^l  min 


(1*8) 


For  a  Toeplitt  and  Hermitian  covariance  matrix  estimate, 

2 

each  iteration  can  be  performed  with  0(p  )  multiply- 
adis  using  Zohar's  algorithm. An  adternate  algorithm 
by  Oueguen^®  can  be  used  to  avoid  numerical  difficulties 


that 


may  be  associated  with 

For  the  case  of  correlated  noise,  we  have 


D 


n-i: 


“M+l’ 


iteration  to 
iteration  is 


We  suggest  generalizing  the  accelerated 

avoid  calculation  of  P  Our  general 
given  by 


<  Vl  -  V’2«*l  = 


M  ’ 


iwel  "  Anln  > 

f-.'  t 

%1+1^+1 


M*1 


and 


min 


(1*9) 


.  X  (6D)x 

6>  •  — - -  (rp) 

X  Wx 

Our  approximation  to  the  new  is  then  given  by  the 

sum  of  6X  and  the  previous  X  ,  .  With  appropriate 

Bln 

definitions,  this  approximation  replaces  in  (1<9) 
when  we  expect  the  new  covariance  matrix  estimate  to 
differ  considerably  from  the  previous  estimate. 

Relationship  to  Linear-Constraint  Solution 

The  iterative  implementation  of  the  quadratic- 
constraint  solution  gives  insight  into  the  linear- 
constraint  solution.  Namely,  the  first  Iteration  of 
(1*7)  with  D«R  and  “  1>  yields  the  linear-constraint 
solution  of  (26)  within  a  constant  of  proportionality. 
Repeated  calculation  of  the  linear-constraint  solution, 
with  h  at  each  step  equal  to  a®  of  the  previous  step , 
is  in  fact  an  Iterative  implementation  of  the  quadratic- 
constraint  solution.  It  is  apparent  that  at  each  step, 
the  constraining  hyperplane  is  realigned  accordlnm  to 
the  estimated  solution.  With  these  insights.  It  is 
reasonable  to  choose 


(53) 


as  the  linear  constraint  for  correlated  noise,  since  it 
yields  the  first  step  of  the  iterative  quadratic- 
constraint  solution  (without  acceleration)  for  corre¬ 
lated  noise  given  in  (1*9).  This  Justifies  the  choice 
made  earlier  in  (36). 

In  this  section  we  have  presented  a  recursive 
algorithm  (1*9)  for  Implementing  the  quadratic-constraint 
solution.  The  algorithm  makes  use  of  the  sequential 
nature  of  the  snapshot  data  to  efficiently  employ  in¬ 
verse  iteration.  The  algorithm  Includes  the  case  of 
correlated  noise.  A  modification  to  the  algorithm  (52) 
was  presented  for  the  case  where  successive  covariance 
matrix  estimates  differ  considerably. 

Covariance  Matrix  Estimate 


2 

Each  iteration  still  only  requires  0(p  )  mult Iply-adds , 

and  B  is  never  calculated. 

In  some  applications,  it  may  not  be  desirable  to 
calculate  the  solution  vector  after  every  snapshot. 

For  instance,  forming  a  new  covariance  matrix  estimate 
and  calculating  a  new  solution  only  every  L  snapshots 
reduces  the  average  computation  rate  by  a  factor  of  L. 
Unfortunately,  if  the  new  covariance  matrix  estimate 
differs  considerably  from  the  previous,  the  previous 
eigenvalue  may  be  a  poor  approximation  to  *ud 

convergence  will  be  slowed.  To  obtain  a  better  eigen¬ 
value  approximation,  we  apply  perturbation  techniques. ^9 
Suppose  that  0  and  W  are  Hermitian  matrices  and  that  we 
have  solved  the  eigenvalue-eigenvector  problem 

Dx  «  XWx  .  (50) 

Applyirir  a  Hermitian  perturbation  40  to  0  we  have  the 
now  problem 

(0+4O)(x+4x)  »  (X  +  4X)w(x  +  4x)  .  (51) 


To  employ  the  proposed  processing  methods,  an 
estimate  of  the  covariance  matrix  Is  required.  From 
this  estimate,  a  solution  vector  i  ;  obtained  and  the 
zeros  of  the  vector's  z-transform  examined  to  determine 
the  plane  wave  spatial  frequencies.  Given  a  pxl  solu¬ 
tion  vector  and  q  plane  waves,  q<p,  there  will  be  q 
"signal"  zeros  and  p-q-1  "noise"  zeros.  These  zeros 
must  be  separated  from  one  another.  It  Is  well  known 
that  in  the  linear  prediction  solution,  dominant  fre¬ 
quency  components  will  generate  zeros  closer  to  the  unit 
circle  than  less  powerful  components;  thus,  a  simple  way 
to  evaluate  signal  zero  locations  is  to  search  for  nulls 
in  the  solution  vector's  Fourier  transform.  For  the 
quadratic  solution  in  white  noise,  it  can  be  shown^*^ 
that  all  of  the  zeros  will  be  on  the  unit  circle  when 
the  covariance  matrix  estimate  is  both  Hermitian  and 
Toeplitz.  Thus  the  estimated  frequencies  can  be  di¬ 
rectly  employed  in  a  power  determination  technique'^  ’  " 
and  the  zeros  separated  on  a  basis  of  signal  iK)vor  as 
be  fore . 
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A  standard  covariance  matrix  estimate  is 
M 

m=l 

This  estimate  is  unbiased,  Hermltian,  but  in  general 
not  Toeplitz.  Furthermore,  only  one  lag  product  from 
each  data  vector  is  used  in  formulating  each  element  of 
R|^|.  An  alternate  estimate  is  the  matrix  ^  whose  ele¬ 
ments  are  given  by 

•=  c(l-j),  l<l,J<p  {5»<) 

where 

M  p-n-1 

m=l  1=0 

c(n)  =  c*(-n)  ,  -p  +  l£n<0. 

This  estimate  is  unbiased,  Hermitian,  and  Toeplitz. 
Also,  p-n  lag  products  from  each  data  vector  are  used 
in  formulating  each  element  c(n).  Thus  the  estimate  of 
(S**)  has  lower  variance  than  that  of  (53). 

In  the  following  simulations,  the  standard  non- 
Toeplitz  estimate  (53)  will  be  used  in  the  linear- 
constraint  solution  in  order  to  compare  with  previous 
simulations.^  However,  for  the  quadratic-constraint 
solution  the  Toeplitz  structure  is  important,  hence  the 

Toeplitz  estimate  R  (5U)  will  be  used. 

M 

Simulation  Results 

To  compare  the  performance  of  these  two  processing 
methods,  the  data  vectors  (12)  were  generated  by  com¬ 
puter  simulation.  The  simulation  model  corresponded  to 
that  chosen  by  Gabriel^  in  his  comparative  paper. 

Namely,  the  case  of  two  sources  incident  on  an  array 
with  white  noise  was  considered.  The  parameter  selec¬ 
tions  were  q=  2,  p  =  8,  =  1 ,  A^  «  A^  “  31.62  (30  dB  SUB) 

and  3.162  (10  dB  SNR),  6^  =  18°,  M«50  (many 

snapshots)  and  10  (few  snapshots),  and  d>X/2. 

With  white  noise,  the  linear  solution  and  the 
quadratic  solution  correspond  to  the  Wiener  and 
Pisarenko  methods.  The  simulation  results  are  shown  in 
Figure  3.  In  this  figure,  the  linear  solution  has  been 
evaluated  via  its  Fourier  transform  and  the  quadratic 
solution  via  the  power  determination  technique.  Over- 
laycd  solutions  for  ten  different  realizations  of  the 
random  data  are  shown  to  give  a  sense  of  each  method's 
consistency. 

These  results  show  that  both  methods  work  well  at 
high  SHR  with  many  snapshots.  However,  the  linecu* 
solution  performs  poorly  at  low  SHR  with  few  snapshots, 
while  the  quadratic  solution  continues  to  give  good 
resolution  and  good  suppression  of  spurious  effects. 

In  general,  the  quadratic  solution  has  shown  better 
performance  than  the  linear  solution  over  a  vide  range 
of  conditions. Further  simulations  are  underway  to 
compare  the  performance  of  the  siethods  in  correlated 
noise  and  to  evaluate  the  recursive  algorithm  presented 
above.  The  results  will  be  given  at  the  conference. 

Conclusions 

We  have  detailed  an  algebraic  approach  to  array 
processing  based  upon  approximation  of  an  orthogonality 
condition.  The  approach  encompasses  several  contempo¬ 
rary,  high  resolution  methods.  Previous  results  were 
extended  to  the  ease  of  correlated  noise,  and  a  recur¬ 
sive  algorithm  presented  for  the  quadratic-constraint 


solution.  The  quadratic-constraint  solution  appears  to 
be  particularly  effective  and  suggests  further  investi¬ 
gation  of  eigen-analysis  array  processing  methods  and 
their  implementation. 
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Figure  3.  Two-source  simulation  with  sources  at  18  and  22  degrees. 

(A)  30  dB  SNR,  50  snapshots  I 

(B)  10  dB  SNR,  10  snapshots  /  Linear  solution 

(C)  Single  trial  from  (B)  ) 

(D)  30  dB  SNR,  50  snapshots 

(E)  10  dB  SNR.  10  snapshots 

(F)  Single  trial  from  (E) 


Quadratic  solution 
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